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ABSTRACT 

The structure-specific endonuclease activity of the 
human XPF-ERCC1 complex is essential for a 
number of DNA processing mechanisms that help 
to maintain genomic integrity. XPF-ERCC1 cleaves 
DNA structures such as stem-loops, bubbles or 
flaps in one strand of a duplex where there is at 
least one downstream single strand. Here, we 
define the minimal substrate requirements for 
cleavage of stem-loop substrates allowing us to 
develop a real-time fluorescence-based assay to 
measure endonuclease activity. Using this assay, 
we show that changes in the sequence of the 
duplex upstream of the incision site results in up 
to 100-fold variation in cleavage rate of a stem- 
loop substrate by XPF-ERCC1. XPF-ERCC1 has a 
preference for cleaving the phosphodiester bond 
positioned on the 3 -side of a T or a U, which is 
flanked by an upstream T or U suggesting that a 
T/U pocket may exist within the catalytic domain. 
In addition to an endonuclease domain and 
tandem helix-hairpin-helix domains, XPF has a di- 
vergent and inactive DEAH helicase-like domain 
(HLD). We show that deletion of HLD eliminates 
endonuclease activity and demonstrate that 
purified recombinant XPF-HLD shows a preference 
for binding stem-loop structures over single strand 
or duplex alone, suggesting a role for the HLD in 
initial structure recognition. Together our data 
describe features of XPF-ERCC1 and an accepted 
model substrate that are important for recognition 
and efficient incision activity. 



INTRODUCTION 

The human XPF-ERCC1 complex is a metal-dependent 
structure- specific endonuclease which is required for a 
number of pathways essential for genome maintenance 
(1). It has roles in several DNA repair processes including 
nucleotide excision repair (NER), interstrand crosslink 
(ICL) repair, double-strand break (DSB) repair and 
recombinational repair (2-5). Mutations in XPF (xero- 
derma pigmentosum complementation group F) in 
humans gives rise to a defect in the S'-incision step of 
NER of bulky adducts in one strand of DNA and results 
in the UV sensitive cancer-prone syndrome xeroderma 
pigmentosum. A more severe manifestation of an XPF 
defect led to progeroid syndrome, a premature ageing 
disease with a similar phenotype to both XPF and 
ERCC1 knockout mice (6-8). ERCC1 (excision repair 
cross complementation group 1) was first identified as a 
human gene that restored the DNA repair capability to a 
UV- sensitive Chinese hamster ovary cell line (9). One case 
of a defect in ERCC1 in humans has been identified to date 
which gave rise to congenital cerebro-oculo-facio-skeletal 
syndrome (COFS) (10), a condition usually associated with 
a defect in repair of transcribing genes [for review see (1 1)]. 

The XPF and ERCC1 polypeptides both belong to the 
highly conserved Mus81-XPF structural family of 
proteins (12-15) possessing a common core containing a 
nuclease or nuclease-like domain and tandem C-terminal 
helix-hairpin-helix (HhH) 2 domains (Figure 1A) which 
bind DNA (16-19). Human XPF also has a highly diver- 
gent helicase-like domain (HLD) which has apparently 
lost residues required for ATP hydrolysis (Figure 1A) 
(20) and may be involved in protein-protein interactions 
or DNA binding. XPF binds to SLX4, a component of 
the SLX4-SLX1 endonuclease complex, which recruits 
multiple structure- specific endonucleases to 
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Figure 1. Minimum substrate requirements for incision activity. (A) Top: domain organization of the human XPF-ERCCl complex; bottom left 
panel: gel-filtration profile of XPF-ERCCl expressed in E. coli after purification on nickel-agarose, and heparin-Sepharose. XPF-ERCCl eluted 
where indicated. Fractions 7-8 is the void and the peak at 11ml corresponded to a MW of ~250kDa, and 13 ml to 35kDa. Inset shows 12% SDS- 
PAGE gel of fraction 12 stained with Coomassie blue. XPF migrates at 115kDa and ERCC1 at 36kDa. Bottom right panel: sequencing gel showing 
cleavage of 200fmol stem-loop substrate by wild-type XPF-ERCCl and XPF- D731N -ERCC1 mutant. Lanes 1 and 5 contained protein buffer from 
each preparation of protein, lanes 2-4 contained 12, 24 and 48 nM XPF-ERCCl, respectively. Lanes 6-8 contained 12, 24 and 48 nM XPF- D731N - 
ERCC1. (B) Sequencing gel showing cleavage of stem-loops with either 6 8, 10 or 12 bp duplex stems. Wells contained protein buffer (— ) or 42.6 nM 
XPF-ERCCl (+) in buffer containing either 50 or 100 mM NaCl as indicated. Cleavage of the 12-bp duplex yields products of 10 and 8 bp, and the 
10-bp duplex is cleaved at one major site yielding an 8-bp product. Dual uncleaved bands in lanes 1-8 are full-length (FL) and FL-1 nucleotide 
oligonucleotides produced during synthesis of the shorter 6 and 8 bp substrates. (C) Separation of products of cleavage reaction on a 10% denaturing 
polyacrylamide gel. Stem-loops (100 nM) labelled 5' with Cy-5 with a phosphorothioate bond replacing the phosphodiester bond 3 ; to the base 
represented in lower case in the sequences 5 / -CAGCGCTCGG-3 / (TC), 5 / -CAGCGCTcGG-3 / (Tc), and 5 -CAGCGCtCGG-3 (tC) in the top strand 
of the duplex were incubated with 42.6 nM XPF-ERCCl for 1 hour. (D) Ten percent sequencing gel showing incision reactions containing stem- 
loops with single-strand loops of increasing T loop size as indicated. An amount of 400 fmol labelled stem-loop substrates were incubated with buffer 
only (— ) or 42.6 nM XPF-ERCCl (+). (E) Sequencing gel showing incisions made by XPF-ERCCl on stem-loop, splayed arm, 3'-overhangs, 
S'-overhangs and duplex. The sequence of the duplex portion of the structure is 5 / -CAGCGCTCGG-375 / CCGAGCGCTG-3 / and single-strand 
regions are 20T for stem-loop and 10T for splayed arms/overhangs. The top strand of each structure was labelled 5 ; as indicated and oligonucleo- 
tides were annealed and purified at 15°C as described in experimental procedures. The cleavage product is the same eight-base oligo for each 
structure as indicated in the figure. 
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recombination and repair processes in the cell (21-24) but 
the interacting region on XPF has not been identified to 
date. Core catalytic and (HhH) 2 domains of archaeal 
XPFs homodimerize and bind DNA asymmetrically 
such that only one catalytic site can be active at a time 
(16,17). Eukaryotic XPFs have diverged from this ar- 
rangement and form obligate heterodimers with structur- 
ally related, catalytically inactive partners via their 
C-termini (13). The catalytic motif resides in the XPF 
protomer of human XPF-ERCC1 (25) and ERCC1 
instead binds XPA for recruitment to the NER process 
(26,27) (Figure 1A). In humans, members of the XPF 
family also include Mus81-Emel/Eme2 and FancM/ 
FAAP24, although the latter has lost essential residues 
for nuclease activity but has retained an active 
N-terminal translocase domain (28). 

XPF-ERCC1 cleaves one strand of DNA a few bases 
into a duplex upstream from a junction, containing at 
least one downstream single strand. These structures 
arise in the cell during DNA processing and repair in 
the form of bubbles, stem-loops, splayed arms, 3 / -over- 
hangs, S'-overhangs and 3 / -flaps. In contrast, 
Mus81-Emel/Eme2 shares the substrate specificity dis- 
played by the archaeal homodimeric XPF homologues 
and cleaves junctions with a downstream duplex including 
nicked Holliday junctions, forks, 3 / -flaps and D loops (29). 

In order to understand more about the interaction of the 
full-length recombinant human XPF-ERCC1 complex 
with its substrate we have established the minimum DNA 
junction requirements for incision on a generally accepted 
stem-loop model substrate (2,19,25). We have used this in- 
formation to develop a fluorescence-based assay to measure 
the kinetic constants of the incision reaction at a single 
incision site. Using this assay, we show that the duplex 
sequence at the incision site influences the incision rate 
and XPF-ERCC1 prefers to cleave the phosphodiester 
bond on the 3'-side of a T which is flanked on the 5'-side 
by a T or an A. Substitution of these bases with deoxy-uracil 
paired with A increased the rate of incision by <5-fold, sug- 
gesting that a pocket exists in the nuclease domain which 
accommodates pyrimidines, and the exocyclic groups 
present on C and T inhibit entry into the pocket. We then 
demonstrate that the HLD of XPF is necessary for activity, 
in contrast to a previous study which used a different trun- 
cation mutant (19). We show that recombinant purified 
XPF-HLD preferentially binds the stem-loop structure 
implicating a role for the XPF-HLD in structure recogni- 
tion and stabilization of the core domains at the junction. 



MATERIALS AND METHODS 

DNA substrates 

PAGE-purified DNA oligonucleotides were obtained 
from Sigma with the following sequences: 

Steml2-loop20: 5 / -GCCAGCGCTCGG(20T)CCGAGC 
GCTGGC-3'; 

StemlO-loop 20:5 / -CAGCGCTCGG(20T)CCGAGCGC 
TG-3'; 

Stem8-loop20:5 / -GCGCTCGG(20T)CCGAGCGC-3 / 



Stem6-loop 20: 5 / -GCTCGG(20T)CCGAGC-3 / ; 
Control stemlO-loop 2:5 / -CAGCGCTCGGTTCCGAGC 
GCTG-3'; 

Fl: 5 / -CAGCGCTCGG-3 / ; F2: 5 / -CCGAGCGCTG-3 / ; 
F3: 5 / -CAGCGCTCGG(l 1T)-3';F4: 5'-(l 1T)CCGAG 
CGCTG-3 / . 

For fluorescence polarization: 

15-mer single strand: 6-FAM-5 / -GCGCTCGGCGTTTT 

T-3'; 2T stem-loop: 6-FAM 5-CAGCGC 
TCGG(2T)CCGAGCGCTG-3 / 

Other sequences were used as described in the results. 

Proteins 

Human XPF-ERCC1 wild-type and mutant proteins were 
produced in Escherichia coli strain FB810 BL21 (DE3) 
using a dicistronic expression plasmid derived from 
pET30b. ERCC1 (residues 1-297) was expressed first with 
a polyhistidine tag (His6) at the C-terminus. XPF (residues 
12-916) had a His-6 at the N- terminus. Recombinant 
protein was extracted from harvested E. coli cells in 
20 mM HEPES/KOH pH 7.4 containing 500 mM NaCl, 
2mM (3-mercaptoethanol, 2mM MgCl 2 , 10% glycerol, 
0.01% CHAPS, 5mM imidazole and protease inhibitors. 
The resulting extract was incubated with Ni-NTA resin for 
30 min, poured into a disposable column and washed with 
extract buffer containing 40 mM imidazole. XPF-ERCC1 
was eluted in buffer containing 500 mM imidazole. 
Fractions from the Ni-NTA column were dialysed into 
heparin buffer; 20 mM HEPES/KOH pH 7.4 containing 
50 mM NaCl, 2mM (3-mercaptoethanol, 10% glycerol, 
0.01% CHAPS and 0.5 mM EDTA before concentrating 
and loading onto a Hi-trap heparin column and eluted in 
heparin buffer containing 500 mM NaCl. Fractions con- 
taining XPF-ERCC1 were concentrated to 0.5 ml and 
loaded onto a Superose 12 column (GE Healthcare) previ- 
ously equilibrated in 20 mM HEPES pH 7.4 containing 
50 mM NaCl, 0.5 mM EDTA, 0.5 mM DTT, 2mM 
MgCl 2 10% glycerol and 0.01% CHAPS. Fractions from 
the Superose 12 column were stored at — 80°C in 10 mM 
HEPES pH 7.4, 2.5 mM (3-mercaptoethanol, 50% glycerol, 
0.25 mM EDTA and 25 mM NaCl. 

The A 640 XPF-ERCC1 truncation mutant was purified 
by the same method as wild-type XPF-ERCC1, except 
that the pooled fractions from the Ni-NTA column were 
diluted 1:20 in heparin buffer before loading onto the 
Hi- trap heparin column. The gel filtration was carried out 
using a 16/60 prep grade Superose 12 column. The 
XPF D73 i N -ERCCl mutations were made using Quik- 
change site-directed mutagenesis kit (Stratagene) and the 
above WT construct as template, expressed and purified 
using exactly the same procedure as wild-type. XPF- 
HLD was produced in E. coli strain FB810 using vector 
derived from pET41b (Novagen) containing an 
N-terminal GST tag and a 3C protease cleavage site. A 
stop codon was introduced after A672 in XPF. Cells were 
induced with IPTG at a concentration of 50 uM for 24 h at 
18°C. Protein was extracted from harvested E. coli cells in 
50 mM Tris pH8 containing 500 mM NaCl, 1 mM DTT, 
10% glycerol, 0.1% TX-100, and the protease inhibitors 
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AEBSF, benzamidine and Complete inhibitor cocktail 
(Roche). Glutathione beads (Pierce) were added and 
incubated for 1 h at 4°C. Protein was cleaved with 3C 
protease and loaded onto a Superose 12 column previously 
equilibrated in elution buffer. Fractions were dialysed into 
elution buffer containing 50 mM NaCl and stored at 
— 80°C. Protein measurements were carried out using the 
Bio Rad protein assay kit (Bio-Rad). 

Gel-based nuclease assays 

Stem-loops and oligonucleotides Fl and F3 (sequences 
described above and used to make the structures in 
Figure 3E as described below) were labelled on the 
S'-terminus with [y 32 P]ATP using T4 polynucleotide 
kinase (NEB) for 30min at 37°C, and the reaction 
stopped by heating to 95°C for 3min. Fl was mixed with 
2 M excess of F2 for double-strand (duplex), and F4 for 5'- 
overhang. F3 was mixed with F2 for 3 / -overhang, and F4 
for splayed arm. 

The DNA was annealed by heating to 95°C for 15min, 
followed by 60°C, 37°C, and 25°C for 10 min each, and kept 
on ice before purifying on a 10% non-denaturing polyacryl- 
amide gel in TBE (Tris, borate, EDTA electrophoresis 
buffer) and diffusion overnight at 4°C into T.E. (10 mM 
Tris, 1 mM EDTA) containing 50 mM NaCl, and stored 
at — 20° C. Reactions were carried out in a volume of 20 ul 
at 25° C for 90 min in reaction buffer containing 50 mM 
Tris pH 8, 20 mM NaCl, 0.5 mM DTT (dithiothreitol), 
10% glycerol, O.lmg/ml bovine serum albumin (BSA) 
and 0.75 mM MnCl 2 (unless indicated). Incision products 
were separated on a 12% sequencing gel (Sequagel- 
National Diagnostics) for 2h. The gel was removed and 
dried and products visualized by auroradiography, or on 
a STORM phosphorimager (Molecular Dynamics). 

Fluorescent gel-based nuclease assays contained 100 nM 
oligonucleotide (obtained from Sigma) labelled on the 5'- 
terminus with 6-FAM (an isomer of carboxyfluorescein) or 
Cy5 (fluorescent dye from the cyanine family). These were 
incubated for lh with 12nM XPF-ERCC1 in reaction 
buffer. Products were separated on a 12% sequencing gel. 
One plate was removed from the gel and the fluorescence 
measured directly on a STORM phosphorimager (blue 
channel for 6-FAM and red channel for Cy5). 

Microplate fluorescence incision assay 

Polyacrylamide gel electrophoresis (PAGE)-purified 
oligonucleotides labelled 5' with 6-FAM and 3 / -with 
dabcyl ([4-((4-(dimethyllamino) phenyl)azo)benzoic acid) 
(pre-labelled molecular beacons from Sigma). These were 
diluted to 100 uM in T.E containing 50 mM NaCl and 
stored at — 20° C. After thawing, substrates were annealed 
by heating to 95°C for 1 min, and cooling to room tempera- 
ture. Reactions contained the appropriate concentration of 
substrate, 50 mM Tris pH 8, 20 mM NaCl, 0.5 mM DTT, 
10% glycerol and 0.75 mM MnCl 2 (unless indicated) in a 
total volume of 20 ul. XPF-ERCC1 was thawed immedi- 
ately before use and diluted in protein storage buffer for 
assay. Reactions were carried out in 384-well black, 
flat-bottomed microti tre plates (Corning 3854) and 
readings were taken in a Tecan Safire II fluorimeter 



(Tecan group LTD) using Megellen software at an excita- 
tion and emission wavelengths of 483 and 525 nm, respect- 
ively. For kinetic runs, 30 readings were collected at 2-min 
intervals. 

Kinetics 

The rate of increase in fluorescence units produced by the 
enzyme per minute during the initial phase of the reaction 
(0-10 min) was measured in triplicate. The F max and K m 
values were calculated using Sigmaplot software (Systat 
Software Inc) and the Michaelis-Menten equation: 

Y= V m ^x/K m + x 

where x = substrate concentration (nM) and Y = reaction 
velocity (relative fluorescence units/min). 

Femtomol (fmol) product release was quantified by 
plotting the relative fluorescence units (RFU) produced 
by known amounts of the oligonucleotides 6-FAM-5 / -C 
AGCGCTC and GG(20T)CCGAGCGCTG-3 / -dabcyl 
corresponding to the cleavage products. A standard 
curve and predicted fmol cleavage product released was 
calculated from this. 

Fluorescence polarization 

Assays were carried out and equilibrium dissociation con- 
stants (Kj)) calculated using PRISM (Graphpad Software 
Ltd) as described in (28). Reactions contained 5 uM DNA 
substrate and 0.005% Tween-20. The oligonucleotides 
used were: StemlO-loop 20 (above with sequence substi- 
tutions as described in the text), or 15T single stranded 
oligonucleotide labelled 5' with 6-FAM, or indicated in 
the text. The binding reactions were incubated for 
30 min at 25°C and read in a Tecan Saffire 2 fluorimeter, 
using Megellen software (Tecan group Ltd). 

Biacore surface plasmon resonance 

Experiments were carried out on a T200 Biacore analyser 
(GE Healthcare). PAGE-purified biotinylated oligonucleo- 
tides were obtained from Sigma, diluted to lOnM in T.E 
containing 50 mM NaCl and bound to a Series S Sensor SA 
sensor chip (GE Healthcare) according to manufacturers' 
instructions at a flow rate of 10(il/min to give a final 
theoretical i? max of 100 after analyte binding. XPF- 
ERCC1 superose 12 fractions were pooled and dialysed 
overnight into 10 mM HEPES, 50 mM NaCl, 0.5 mM 
DTT and 0.05% P20 (GE Healthcare) surfactant. Single 
cycle kinetic analysis was carried out according to the 
Biacore software using 2-fold dilutions of XPF-ERCC1 
in the above buffer. The data from a reference cell 
without DNA was subtracted and the apparent association 
(K a ) and dissociation (K d ) rate constants were measured 
using the equation for 1:1 Langmuir binding (BIA 
Evaluation 4.1 software). The apparent equilibrium dis- 
sociation constants (Xd) were calculated from these values. 

RESULTS 

Minimum substrate requirements for incision activity 

Previous attempts to purify full-length recombinant human 
XPF-ERCC1 from E. coli resulted in aggregated protein 
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consisting mainly of ERCC1, and only a small amount of 
active heterodimer (25). We found that the addition of 
0.01% CHAPS to the buffers considerably increased the 
yield of soluble, heterodimeric XPF-ERCC1 (Figure 1A). 
After purification on Ni-agarose, heparin sepharose and 
Superose 12-gel filtration the yield was 100-150 jug of pure 
ERCC1-XPF per 2 1 of culture (Figure 1A). This protocol 
separated high and low molecular weight bacteria-derived 
nucleases from the XPF-ERCC1 heterodimer (Supple- 
mentary Figure S2), (Figure 1 A, bottom left). Pooled frac- 
tions 10-12 incised a stem-loop substrate in 7.5 mM MnCl 2 
(Figure 1A, bottom right and Supplementary Figure S2, 
lanes 6 and 8). The catalytically impaired mutant 
XPF- D731N -ERCC1 (25,30,31) was prepared using 
exactly the same protocol to control for the presence of 
bacterial nuclease contamination and showed very little 
specific cleavage (Figure 1A) (Supplementary Figures 
SI A and S2: lanes 18 and 20). This is an important 
control as other nucleases can mimic structure specific 
activity (Supplementary Figure S2, lanes 25-30). 

We used gel-filtered recombinant XPF-ERCC1 to define 
the minimal substrate requirements for making incisions at 
a single defined position in the duplex using a gel-based 
assay. The generally accepted model XPF-ERCC1 
substrate used in previous studies is a stem-loop substrate 
consisting of a 12-bp stem and a 22-base single-strand loop 
(2,19,25). We reduced the duplex length from 12, to 10, 8 
and 6 bp. All of these substrates were gel purified by 
non-denaturing PAGE before use, which ensured that 
they formed the stem-loop structure at 50 mM NaCl. 
Two major incisions were made within the 12-bp duplex 
(Figure IB, lanes 15 and 16) which have been previously 
mapped to the phosphodiester bond 3' to the third and fifth 
bases upstream from the duplex single-strand junction 
(2,32,33). We found that reduction of the duplex to 10 bp 
by removing the two pairs at the free end of the duplex 
resulted in only one incision and this yielded a product of 
eight bases (Figure IB, lanes 11 and 12). The incision 
position was verified by replacing the cleaved phospho- 
diester bond with a non-cleavable phosphorothioate link 
in which a non-bridging oxygen (PO) is replaced by 
sulphur (PS), and this prevented incisions only when it 
was placed 3' to the third base upstream from the duplex 
single-strand junction but not at the fourth (Figure 1C). No 
cleavage occurred on the substrates containing 6- or 8-bp 
stems (Figure IB, lanes 4, 5, 7 and 8). We therefore had 
produced conditions for a single incision within the 
stem-loop substrate. Catalytic activity was reduced at 
lOOmM NaCl indicating that DNA binding by the 
enzyme may be reduced at higher salt, or the increase in 
duplex stability inhibits the enzyme. 

The minimal single-strand length was examined using the 
10-bp duplex stem-loop substrate. The loop consists solely 
of Ts to avoid internal base pairing which may occur if 
mixed sequences are used, or tracts of Gs and Cs, which 
may base pair with the GC-rich duplex. The loop was 
reduced to 20, 10, 8, 6, 4 and 2 Ts (Figure ID). The 
minimum single-strand loop requirement was between 4 
and 6 thymidines and this confirms a previous finding 
where the single-strand loop was reduced on substrates 
with concomitantly increasing duplex regions (32). 



Incision activity increased up to a loop of 20 Ts and 
consequently the 10 duplex-20 loop structure incised at a 
single site by XPF-ERCC1 was used for all subsequent 
activity assays. 

Oligonucleotide structures were made with the identical 
10-bp duplex and either 5' or 3' 10-T single-strand over- 
hangs, or a splayed arm containing both. XPF-ERCC1 
cleaved all the structures at exactly the same position 
in the duplex, yeilding an 8-base labelled incision product, 
(Figure IE, lanes 1-4) irrespective of the position of 
the single-strand region. A S'-single-strand arm on the 
cleaved strand is not required for cleavage when a 5'- 
single-strand arm is present on the uncleaved strand and 
vice versa. We found that duplex DNA is not cleaved at 
all under these conditions (Figure IE, lanes 9 and 10). 

Kinetics of XPF-ERCC1 incision activity 

We then developed a real-time fluorescence-based micro- 
ti tre plate assay using the 10-duplex 20-T single-strand 
stem-loop model substrate (Figure 2 A, left). This was 
labelled on the S'-termini with 6-FAM and 3' with the 
quencher, dabcyl. In the annealed state, the dabcyl 
quenches the fluorescent signal. The 10-bp duplex region 
of this stem-loop has a melting temperature of 29° C at 
20 mM NaCl. Cleavage in the 10-bp duplex region by 
XPF-ERCC1 generates an eight-base oligonucleotide, 
which has a melting temperature of ~23°C at 20 mM 
NaCl that diffuses into solution at the incubation 
temperature of 25° C, resulting in an increase in fluorescent 
signal with time (Figure 2 A, right). An amount of 1.5 nM 
(30fmol) of XPF-ERCC1 gave reproducible results with a 
low signal/noise ratio and the incision rate (relative fluores- 
cence units (RFU)/min) increased with increasing XPF- 
ERCC1 concentration. The XPF D731A -ERCC1 point 
mutation has been shown to abolish incision activity 
whilst retaining DNA binding (25) although some residual 
NER restoration activity has been noted in one study (30). 
The catalytically impaired XPF D73 i N -ERCCl showed con- 
siderably less incision activity than the wild- type endonucle- 
ase (Supplementary Figure SI A) although this was not 
completely abrogated at very high concentrations. 

Replacement of a non-bridging oxygen on the scissile 
phosphate (PO) with a phosphorothioate (PS) (Figure 
2B) eliminated nearly all of the endonuclease activity con- 
firming that the increase in signal was due to incisions by 
the enzyme at this position, in agreement with the 
gel-based assay (Figure 1C). 0.75 mM MnCl 2 was found 
to be optimal giving a 4-fold higher rate of incisions than 
the maximal MgCl 2 concentration of 5 mM in accordance 
with previous work (2,25,32) (Supplementary Figure SIB). 
The incision reaction was sensitive to pH, which peaked at 
pH 8 with a sharp fall-off in activity at pH 7.5 and 8.5 
(Supplementary Figure SIC). The kinetic constants for 
XPF-ERCC1 activity were then measured under 
steady-state conditions where substrate was in excess 
(Figure 2C). Representative Michaelis-Menten and 
Hanes-Woolf graphs are shown. The k CSLt and K m for 
human XPF-ERCC1 under these conditions was 
calculated to be 0.19/min ± 0.04 SE (standard error) and 
19.2 ± 0.76 nM, respectively, for three independent 
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experiments. This was consistent with a previous study 
where the k CSLt was found to be 0.17 ± 0.04 min -1 on the 
stem-loop substrate with a 12-bp duplex measured using a 
gel-based assay system (19). 

Local incision site sequence preferences 

Cleavage of synthetic substrates by XPF-ERCC1 has gen- 
erally been regarded as sequence independent, although 
preference for a pyrimidine 5' to the scissile phosphate 
at the incision site has been observed (32-36). In an 
effort to understand the basis for this apparent discrimin- 
ation, we used the fluorescence assay to measure the 
kinetics of cleavage of stem-loops where the nucleotides 
at positions X and Y, immediately upstream of the single 
scissile phosphate (Figure 3A), were substituted with each 
of the four bases and their corresponding complementary 
bases in the opposite strand. The tandem G:C base pairs 
at the junction were kept constant in order to maintain its 
stability. The base substitutions resulted in differences in 
K max , which ranged 18.7-fold from 8.4 to 157 RFU/min 



(Figure 3B). K m values only varied 3-fold indicating 
similar binding levels to all sequence variants at X and 
Y. (Figure 3B). As a control, an abasic site at Y was not 
cleaved at all (data not shown). 

Substitutions of the base at position X clearly have an 
impact on the incision rate (Figure 3B). There was a 2-fold 
preference for cleaving on the 3 ; -side of a pyrimidine at Y 
when X was a T. However, where X was A, C, or G there 
was a preference for a T at Y, but not for C. The presence 
of a G or C at position X resulted in ~2-fold reduction in 
the F max for each substitution at Y (Figure 3B). One 
possible reason for this difference is that the cleaved 
oligonucleotide product containing G/C may have 
dissociated from the duplex at a lower rate in the assay. 
To test for this, the products of a reaction were denatured 
and separated on a denaturing polyacrylamide gel 
(Figure 3C). After 15-min incubation [during the linear 
part of the reaction (Figure 2A)], the substrates containing 
A or T at X produced more cleavage products than those 
with G or C at X demonstrating that this was not an 



Tm = 29 ( 



XPF-ERCC1 

ctcgg 0fyr ' 



Fluorescein -afe'CAGCGCTCuu 90T ^ 
) *3'GTCGCGAGCC >y 



5'CAGCGCTC 
*3'GTCGC 



B 



25°C, 20 mM NaCl 



20 
18 
16 
14 
12 
V 10 



GG 



3'GTCGCGAGCC /U1 
n=23°C <Q±.D> 



PO 



PS 




0 500 1000 1500 2000 2500 3000 3500 

Time (seconds) 




0 20 40 60 80 100 120 140 160 
[Substrate] (nM) 




Vmax = 74.2 fmol/min 
Km = 19.2nM 
kcat= 0.190 min 



-20 0 20 40 60 80 100 120 140 160 
[Substrate] (nM) 



Figure 2. A microplate fluorescence-based assay to measure XPF-ERCC1 incision activity. (A) Left panel: principle of the fluorescence-based assay. 
RFU is relative fluorescence units. Incubation of intact stem-loop with XPF-ERCC1 in 0.75 mM MnCl 2 and 20 mM NaCl results in the cleaved 
oligonucleotide diffusing into solution at 25°C. Right panel: increase in fluorescence (RFU) with time. Reactions are shown in triplicate and 
contained: 3nM XPF-ERCC1, lOOnM standard stem-loop substrate, 0.75nM MnCl 2 , 0.5mM DTT and 20mM NaCl 2 in 50mM Tris buffer pH 
8 at 25°C. Solid squares: buffer only (B) Graph showing rate of incisions (v) of 62.5 nM stem-loop by 3nM XPF-ERCC1 where the scissile 
phosphate contains non-bridging oxygens (PO) or a non-bridging sulphur (PS). Average of three points ± SD. (C) Left panel: Michaelis-Menten 
plot showing rate versus substrate concentration for full-length XPF-ERCC1. Mean of three points. Right panel: Hanes-Woolf plot. 
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Figure 3. Local sequence preferences at the XPF-ERCC1 incision site 
on the substrate affects the rate of cleavage. (A) Stem-loop structure 
showing the positions of substituted bases X and Y in the duplex. Xc 
and Yc are the complementary bases to X and Y, A is used 
complementary to U. (B) Kinetic data where X and Y were substituted 
with each of the four bases. Data are mean of triplicate samples, ± 1 
standard error. (C) Incisions produced by 4.27 nM XPF-ERCC1 on 5' 
6-FAM-labelled stem-loop substrates with bases at X and Y as 
indicated. Arrow shows cleavage product. Reactions were incubated 
for 15min at 25°C and therefore did not run to completion. (D) 
Kinetic data where X and Y were substituted with the indicated 
bases. Data are mean of triplicate samples +/— standard error. 



artefact of the assay itself. Another possibility is that the 
enzyme may be separating the duplex to the base at X 
before cleavage and the presence of the more stable G/C 
pair at X may hinder this. XPF-ERCC1 cannot hydrolyse 
ATP so this would be a passive unwinding event, using the 
energy of binding of the enzyme to the substrate, or as a 
result of bending the substrate upon binding. To test this 
possibility, a mismatch was introduced opposite a T at X 



Table 1. Binding constants for XPF-ERCC1 binding to stem-loop 
and single-strand DNA 
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The Biacore evaluation software generated K a and K d by fitting the 
data to a 1:1 Langmuir interaction model. The equilibrium dissociation 
constant K u is K d /K a . 

by substituting Xc with G. The duplex containing the 
mismatch annealed to form the stem-loop structure and 
migrated in the correct position in a non-denaturing gel 
(data not shown). The T/G mismatch reduced the F max to 
75 RFU/min without a significant change in K m 
(16.2nM). This suggests that either an intact duplex is 
required at this position for efficient substrate recognition 
or cleavage, or the presence of a G at Xc is inhibitory, 
possibly because the 2-amino group of G projects into the 
minor groove of the duplex and may hinder access of an 
amino acid side chain in the catalytic site. 

Surprisingly, substitution of deoxyuridine at Y resulted 
in a 4.4-fold increase in F max over thymine, and at both X 
and Y resulted in a 5.2 increase in incision rate (Figure 3D) 
suggesting that exocyclic groups on C and T may be inhibit- 
ing entrance of these bases into a binding pocket. A 4-fold 
increase in K m also occurred indicating that efficient 
incision promoted release of the complex from its substrate. 
There was no difference in the incision rate when the C at 
position Y (and X = T) was replaced by 5-methyl C, which 
disrupts base stacking. Overall our data show that there 
was a 100-fold sequence-dependent variation in K max 
from 8.4 for CG to 906 RFU/min for UU. 

Full-length XPF-ERCC1 preferentially binds to the 
stem-loop structure 

The kinetics of stem-loop binding for full-length 
XPF-ERCC1 was measured by Biacore surface plasmon 
resonance. Five 2-fold dilutions from 30 nM were applied 
over an SA sensor chip with 1 1 RU of DNA bound. The 
data was analysed using the single-cycle kinetics wizard 
(Table 1). The high chi-square parameter indicates that 
the global 1:1 binding fit was not optimal. XPF-ERCC1 
did not bind to the duplex containing a 2T loop, which is 
consistent with the gel-based nuclease assay in Figure 1. 
The equilibrium dissociation constants in Table 1 show 
that binding to the stem-loop was 7-fold tighter than to 
the 20T single strand representing the loop (K& 
2.19 x 10 -9 and 1.37 x 10 -8 , respectively) which indicates 
that the single-strand binding is a dominant contributor to 
the substrate interaction of full-length XPF-ERCC1. 

The HLD of XPF is required for incision activity 

In order to study the contribution made by the XPF-HLD 
towards incision activity, an amino-terminal truncation 
mutant complex A640XPF-ERCC1 was constructed 
which lacked the entire XPF-HLD. This heterodimer 
consists of the nuclease and (HhH) 2 domains of XPF 
with a short portion of an amino-terminal linker 
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(30.7 kDa), dimerized with the full-length ERCC1 
polypeptide (32.6 kDa) (Figure 4A, top). The truncation 
produced a heterodimeric complex in much greater yields 
(~lmg/l of culture) than full-length XPF-ERCC1 and 
had relatively little aggregate (Figure 4 A, middle). No 
structure-specific activity was seen with the A6 4 0 XPF- 
ERCC1 mutant at either low or high enzyme concentra- 
tions (50 nM and 1.3 uM) (Figure 4A, bottom), even after 
16 h (data not shown). 

The HLD of XPF preferentially binds DNA structures 

Purified, recombinant XPF HLD (Figure 4B, top) bound 
stem-loop structures without regard to sequence at the 
incision site (Figure 4B, middle). At 50 mM NaCl, XPF- 
HLD bound stem loops with an apparent of 63 nM 
(Figure 4B, bottom) but much weaker binding to a 15-mer 
single-strand oligonucleotide and duplex DNA with a 
2T loop was observed (X D 263 and 217 nM, respectively). 
We conclude that XPF-HLD is necessary for measurable 
XPF-ERCC1 activity and that this requirement is due to 
its ability to bind the single strand-duplex junction. 

DISCUSSION 

Here we report a detailed characterization of human XPF- 
ERCC1 endonuclease using a model stem-loop substrate 



that enabled the minimal substrate requirements to be 
defined for a single site incision and binding of the XPF- 
ERCC1 complex to a DNA junction structure. We used 
these data to develop the first real-time fluorescence- 
based assay for human XPF-ERCC1 activity, with the 
potential to facilitate a high-throughput screening for 
small molecule modulators of XPF-ERCC1. The utility 
of such chemical inhibitors of XPF-ERCC1 would be to 
enhance the efficacy of treatment regimes involving 
cisplatin-based drugs (37,38). 

XPF-ERCC1 footprint on the duplex of the stem-loop 
structure 

Much of our understanding of substrate interaction by 
XPFs comes from structural and biochemical studies of 
archaeal homodimers that recognize and cleave DNA struc- 
tures with a downstream duplex such as 3 / -flaps and nicked 
Holliday junctions. Models derived from these studies 
predict that the (HhH) 2 domains of both XPF protomers 
bind ~ 5 bp of duplex DNA upstream and downstream from 
the junction, with a further 5 bp required to extend into the 
catalytic cleft (16,17). This accounts for the observed 
requirement for a 10 base-pair duplex for one incision to 
be made. However, the second hairpin of the human XPF 
(HhH) 2 domain is degenerate and no binding to DNA has 
been demonstrated. In contrast, the highly conserved 




nM XPF-HLD 

Figure 4. XPF-HLD is required for activity and preferentially binds stem-loops. (A) Top; gel-filtration profile of A640XPF-ERCC1 after separation 
on Superose 12 16/60. Line indicates fractions pooled for analysis. Bottom; 12% SDS-PAGE gel of fractions from gel-filtration column stained with 
Coomassie blue. Lanes show fractions from a Superose 12 16/60 column. Graph shows incisions by wild-type, catalytically impaired and A 640-XPF- 
ERCC1 complexes at 50 nM or 1.3 jiM, as indicated. (B) Top: 12% SDS-PAGE gel of purified recombinant XPF-HLD stained with Coomassie blue. 
Top graph: binding of purified XPF-HLD to stem loops with TT and GG substitutions at X and Y (Figure 3). Bottom graph: binding of XPF-HLD 
to stem loop with CT substitutions at X and Y, a 15-mer single-strand oligonucleotide, and duplex corresponding to the duplex in the stem-loop with 
a 2T loop, as indicated. 
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Figure 5. Proposed model for stem-loop substrate recognition by XPF-ERCC1. The XPF protomer has a red outline, and ERCC1, grey outline. 
The domain colouring is as Figure 1: (HhH) 2 domains; green, nuclease or nuclease-like domains; mauve and HLD; blue. (A) The XPF-HLD binds to 
the junction containing single-strand stretches greater than six bases and the hairpins of ERCC1 occupy five bases of the duplex. (B) A domain 
movement occurs resulting in bending of the DNA, and the nuclease domain of XPF engages the duplex downstream from the ERCC1 (HhH) 2 
domain. A pocket in the catalytic centre of XPF becomes occupied by the base 5' to the scissile phosphate, and cleavage occurs. (C) After incision 
the complex is released from the substrate. (D) Failure of cleavage due to lack of entry of an appropriate base into the binding pocket, results in the 
complex remaining bound and slow dissociation from the substrate. 



ERCC1 (HhH) 2 domain is capable of binding stem-loop 
DNA (18), and is likely to have retained the upstream 
duplex binding mode predicted by the A.pernix crystal 
structure (16). In addition, we have recently found a 
similar mode of double-strand DNA engagement for the 
homologous FancM-Faap24 complex (Coulthard et al., 
manuscript submitted). We anticipate that the 10-bp foot- 
print on the duplex seen here would comprise the ERCC1 
(HhH) 2 domain binding the upstream 5 bp combined with 
sufficient DNA to enter the catalytic site of XPF (Figure 5) 
as described in Tripsianes et al. (26). 

Role for XPF-HLD in catalytic activity and 
substrate binding 

In our study, highly pure preparations of recombinant 
A640 XPF- fu u i eng thERCCl complex lacking the HLD 
were found to be catalytically inactive. However, a 
previous study concluded that a shorter recombinant 
A666 XPF-A95ERCC1 complex (lacking the HLD and 
a connecting linker to the catalytic domain) exhibited 



some structure-specific nucleolytic activity in a gel-based 
assay, despite the absence of a control for non-specific 
nuclease activity (19). The discrepancy between our 
results and those of Tsodikov et al. (19) might be 
explained by the differences in the XPF truncation 
mutants used. One explanation could be that the acidic 
linker spanning residues (641-666) present in our A 640 
XPF-ERCC1 complex may inhibit activity by binding 
DNA or blocking the catalytic site. Alternatively, we 
cannot rule out the possibility that a contaminating bac- 
terial nuclease in A666 XPF-A95ERCC1 complex prep- 
arations was responsible for the observed nucleolytic 
activity given the long incubation periods (30 min to 1 1 h). 

Our findings suggest a model where the stem loop is 
bound and possibly distorted by the XPF-HLD into a 
more favourable conformation for engagement of the 
ERCC1 (HhH) 2 and XPF nuclease domains with the 
upstream duplex and the junction, respectively (Figure 5). 
Although XPF-HLD binds both single-strand and duplex 
DNA weakly, a 4-fold increased binding was observed on 
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the stem-loop substrate and consequently there may be two 
binding sites each accommodating duplex or single-strand 
DNA which co-operate to bind and possibly distort 
the DNA around the junction. The tighter binding dis- 
played by the full-length complex (X D 2 nM) could be due 
to co-operativity between the XPF-HLD (X D 60-70 nM, 
Figure 5) and the C-terminal complex, which have previ- 
ously been shown bind a stem-loop in an electromobility- 
shift assay with an apparent Kj) of 39 ± 13 uM (18). 

There are no structures available for the nuclease domain 
of XPF bound to DNA and little is known about how the 
DNA is led into the active site. This may be because 
substrate binding within the catalytic centre requires 
remodelling of the substrate or stabilization of the core 
endonuclease domains on the DNA by the XPF-HLD. A 
remodelling role has been described for crenarchael PCNA 
which is required to interact with XPF for activity, and 
co-operates with XPF to bind and kink the DNA (39). 
The structure of the A. pernix XPF bound to duplex 
DNA suggests that the duplex downstream from a nick 
would need to bend by 90° in order to make contact with 
the HhH 2 domains of both protomers (16). A similar mech- 
anism of DNA discontinuity recognition was demonstrated 
in the structures of Fen 1 and its homologue Exol bound to 
their substrates. These proteins recognized a distortion 
caused by a nick in duplex DNA and this enabled binding 
to both upstream and downstream regions via domains 
conserved in this superfamily, resulting in a bend angle of 
100° (40,41). The ATPase activity of the N-terminal 
helicase domain of Hef is stimulated by fork DNA, and 
its presence increases the incision activity of the core 
nuclease and (HhH) 2 domains (42). Since human XPF- 
HLD does not require ATP for incision activity, alterations 
to the DNA structure could result from DNA binding and 
subsequent protein domain movements. Stable binding and 
distortion of the DNA structure would allow limited 
movement of the core domains on the duplex so that the 
catalytic domain is positioned at an optimal sequence and 
would explain why multiple incisions are made in longer 
duplexes. Re-incision of the same substrate is unlikely to be 
responsible for incisions further upstream from the 
junction because the same pattern of cleavage is maintained 
on 3 / -labelled stem-loops (2,32). 

Substrate interactions at the XPF active site 

It is not known whether the DNA enters the XPF catalytic 
site as a duplex or is separated into single strands. Single- 
strand DNA-binding regions have been identified in both 
the XPF nuclease and ERCC1 nuclease-like domains, 
which could interact with the nucleotides downstream 
from the scissile phosphate (26,27). A hydrophobic patch 
was described in the A. pernix model, which could accom- 
modate single-strand DNA (16). These regions may 
stabilize an intermediate resulting from opening of the 
duplex from the junction so that the DNA enters the cata- 
lytic site as a single strand. The (HhH) 2 domains of both 
protomers of the Hef homodimer were shown to mediate 
opening of the duplex in a potassium permanganate foot- 
printing study containing tracts of TA base pairs at the 
junction of the structure (17). The structures of the Fenl 



and Exol S'-flap nucleases with their substrates revealed 
that a terminal T in the incised strand of the S'-flap is 
unpaired by superfamily conserved domains in order to 
engage the scissile phosphodiester bonds with the two 
metal ion catalytic site and seven highly conserved carb- 
oxylates (40,41). The Fenl homologues cleave 1 nt into 
the duplex, in contrast to XPF, which incises the phospho- 
diester bond between the second and the third nucleotide 
suggesting XPF would need to separate the terminal 2 bp. 
The duplex terminus of the substrates used for the Fenl 
structures contained a TA pair in contrast to the stem- 
loop exploited for XPF-ERCC1 which has a more stable 
GC pair. Opening of the duplex by the XPF-HLD might 
occur as a result of DNA bending, or by pushing a wedge or 
pin into the duplex terminus. However, we were unable to 
demonstrate opening at the junction by potassium perman- 
ganate footprinting or 2-aminopurine fluorescence studies 
(data not shown). Therefore, it cannot be ruled out that the 
duplex remains intact at the DNA junction and the scissile 
base is flipped out. In a previous study in which the duplex 
at the junction was crosslinked by a psoralen interstrand 
crosslink, incisions by XPF-ERCC1 showed that they 
occurred at the same position as uncrosslinked duplex sup- 
porting an intact duplex model (33). In this case, the duplex 
may enter the active site intact and become distorted in such 
a way as to flip a base into a binding pocket in the XPF 
nuclease domain. This would explain the apparent reliance 
of incision rate on the stability of the base pair at X. The 
bases at X/Xc may be separated by a component of the XPF 
catalytic domain entering the duplex to flip out the base. A 
candidate for this is the R689 of XPF, which is absolutely 
required for activity (25). The equivalent R26 in A. pernix 
was part of a functionally important region of the catalytic 
site and not directly involved in catalysis or protein folding 
(16). It was postulated to bind to assist in anchoring the 
backbone of the DNA at the junction but it may be import- 
ant for intercalating into the duplex and distortion of the 
duplex or base flipping. 

Implications for cellular XPF-ERCC1 substrates 

It is unclear what role sequence preferences displayed here 
by XPF-ERCC1 might play in NER as filling in of the 
excised patch replaces lost bases, but it may be significant 
for other processes that XPF-ERCC1 is involved with in 
the cell. For instance, it may serve to regulate the activity 
of XPF-ERCC1 on GC rich regions of the genome, or 
have a role in directing incisions of the TTAGGG 3'- 
telomere overhangs. In a cellular context, structures that 
XPF-ERCC1 cleaves includes bubbles, flaps, splayed arm 
and overhangs and possibly more complex structures de- 
pending on the process. The local cellular conditions will 
affect the activity of XPF-ERCC1 in vivo and evidently it 
co-operates with many other protein partners in carrying 
out its endonucleolytic function in NER, ICL repair and 
other genome maintenance processes. The accepted model 
stem-loop substrate used in our present studies has 
enabled us to examine the conditions under which one 
incision was made and appears to be a good surrogate 
for the distorted physiological junction substrates pre- 
sented to XPF-ERCC1 within a cell nucleus. 
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